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We show that it is possible to generalize the Ursell-Mayer cluster formalism so that it may 
cover also the statistics of Internet websites. Our starting point is the introduction of an extra 
variable that is assumed to take account, as will be explained, of the nature of the Internet 
statistics. We then show, following the arguments in Mayer, that one may obtain a phase 
transition-like phenomena. 
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1 INTRODUCTION 

The use of the Internet as a necessary tool for easing the application of an increasing number 
of diverse tasks, and not only for web surfing, is fastly growing. There has been lately an 
ongoing research that discusses the Internet topology [U [2] where use is made of the fractal 
[Bill] and the percolation theories [SUHllZj- In these works the Internet is regarded as random 
network j3J [H] and the websites as its building blocks. 

We focus our attention here on the unique nature of the websites which enables the 
possible existence in them of any number of links that refer to other sites that may be 
downloaded by a single click of the computer mouse. As known, any fractal is built by 
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repeated iteration (see Aharony in [H|) of some unique natural "microscopic" growth rule 
which is rather special in the Internet case. This is because not only the forms of the 
constituent websites, the identity and connectivity |3J EJ E| of their links depend only on 
the programmers that write the relevant softwares but also the growth of the web itself 
depends exclusively upon them. Thus, for taking into account the nature of programming 
that enables one to display on the computer screen practically anything we introduce a special 
variable, denoted in the following by s, that corresponds to the spatial variable r which is 
used to discuss the statistics of the N particle system. The character of this variable s will 
be discussed in the following section. 

We note that similar situations arise in the discussion of some mathematical [E] and 
physical |Hj situations for which one have to add a special variable that takes account of 
some nonphysical properties of the discussed systems. For example, in the functional gener- 
alization [12] of Quantum Mechanics the generalized Hilbert space (the Lax-Phillips one |9J) 
is obtained by adding an extra variable to the conventional Hilbert space. A similar addi- 
tion of an extra variable has been proposed by Parisi-Wu-Namiki [TTJ [T2] in their stochastic 
Quantization theory which assumes that some stochatic process [13] occurs in the extra 
dimension. 

As known, any web site contains one or more web pages and each of these may include 
one or more links to other places on the same page or to other sites or other web servers. 
The user that clicks, through the keyboard, on any link (the highlighted addresses (URL)) 
downloads its relevant site to the screen. The number of the links in a web site may be small 
(or even zero for the unlinked web sites) or it may be very large. We do not consider here 
the secondary links that may be found on the sites referred to by the first links. 

We discuss the connectivity [H El IH IE] among the Internet websites where by this term 
we mean the amount and degree of the interconnection among the sites that compose the 
Internet network. Thus, a large connection among the sites of the Internet denotes a cor- 
responding large connectivity and vice versa. Our aim in this work is to show that if we 
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consider large clusters of mutually linked sites then adding even a very small amount of 
connecting links results in a disproportionally large growth of the total connectivity |3J [E| 
of these clusters. We use, for that purpose, a generalized version of Ursell [14] and Mayer 
O El cluster formalism. This generalization is obtained by the introduction of the re- 
marked extra variable that takes account of the unique Internet statistics. We note that 
similar discussion in Ursell [l4j and Mayer [TH] with regard to the large clusters of particles 
results in demonstrating phase-transition phenomena [Tl* | IT5 j fT6]. 

In Section 2 we discuss the Internet using the partition function method (H3HE1IIII where 
the relevant "configuration integral" is discussed by generalizing the cluster formalism of 
Ursell [H] and Mayer [12]. This generalization is imposed by the unique nature of the Internet 
websites which allows, as will be shown, more than one kind of linkage between the sites. 
Thus, the typical numbering procedure and the combinatorics of the Ursell-Mayer method, 
which was originally formulated for discussing the iV-particle system, has to be appropriately 
expanded. In Section 3 we discuss and generalize the relevant statistical integrals and in 
Section 4 we show the mentioned phase transition of the connectivity among the Internet 
websites. Note that our discussion in this work is general in that we do not specify the 
nature and identity of the relevant sites. 
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2 THE ADAPTATION OF THE URSELL-MAYER CLUS- 
TER FORMALISM FOR THE INTERNET WEBSITES 



2.1 The definitions of the "position", the "distance" and the "po- 
tential" between websites 

We show in this work that the connectivity |UE] in large iV-site clusters, which is determined 
by the links in all the sites, is so sensitive to these connecting links that adding even a small 
number of them to the cluster results in a giant increase of its overall connectivity. This large 
growth of the connectivity, compared to the small addition of links that causes it, constitutes 
a phase transition which should be discussed in the appropriate terms and terminology. For 
that purpose we adapt a generalized version of the virial expansion of the equation of state 
Ej which is discussed by using the cluster formalism developed by Ursell [TJ] and Mayer 
[T5] . We assume that the web sites system discussed here is composed of N sites where N 
is generally a large number. We note that when one discuss the potential energy of the iV 
particle system in the configuration space [TU [TJj the relevant variable is the distance 
between any two particles i and j which depends only on their positions. Thus, the potential 
between them u(ry) is assumed to be effective for small ranges of and vanishes when 
grows. 

Here, also for the N site system we denote by u(sij) the "potential" between the two 
sites i and j which depends upon the "distance", denoted here by s^, between them. This 
"distance" signifies how much these sites differ from each other so that it is shorter the 
more "similar" they are and longer for unrelated sites. The distance between i and j may 
be assigned a Quantitative aspect by taking into account the number of common links to 
both so that the larger this number is the shorter this distance becomes. That is, each 
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site i is characterized by all its links so that its "position" in s space may be written as 
Si(sa, Si2, s^, . . .), where, sn, Si2, Si3 etc denote the links in i to the sites 1, 2, 3, etc. Thus, 
as the real positions in configuration space are measured relative to the origin (zero values 
for coordinates) so the "positions" of the sites here are "measured" relative to the unlinked 
site which has zero link. That is, the more linked some site is the larger is its "distance" 
from the "origin" (the unlinked site). In this context we may use Figure 1 in order to make 
this point clearer. Thus, the unlinked sites denoted [7] and [8] at the right hand side of the 
planar diagram of Figure 1 are at the largest "distance" from each other and from the other 
sites of the figure. On the other hand, the doubly linked sites [15], [16], [23] and [24] in this 
figure have a very small "distance" 

from each other. In such a manner one may write the "distance" between i and j, for 
example, as Sij(s n , s i2 , • • • , Sji, s j2 , ■ ■ ■) = Si{sa, s i2 ■ ■ .)-s j (s jll s j2 , ...). Thus, we may define 
a "potential" u(sy), in analogy to the N particle system, for example, as 



The potential «(%) between the sites % and j does not have a physical meaning but only a 
statistical one. This is because it depends upon the distance between these sites which 
is, as remarked, determined not only by their mutual links (to each other) but also by all 
the other shared and common links which may be very large in number. Thus, this potential 
have a statistical meaning (see the discussion after Eq © and and the Appendix) which 
is expressed using Combinatorical analysis. 

The potential «(%) from the last equation, as for the corresponding ufaj), is effective 
for small "distances" and vanishes when grows. However, in contrast to the N particle 
system in which the distance between any two members i and j is sharply and uniquely 
determined by their positions only here the "distance" between any two sites is also charac- 
terized, as has been remarked, by other links to other sites that are common to both, besides 




1 



(1) 



Si{Sn, Si2, ■ ■ ■ 



) — S j( S jl> S j2) • • • 
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the links that refer to each other. Moreover, since, as remarked, the linking among the sites 
are effected through the highlighted places (upon which one press with the mouse pointer to 
download the linked site to the screen) a site may have such a link to another one whereas 
the second have no link to the first. Thus, in order to discuss appropriately the potential 
between any two sites % and j one must differentiate between four different situations; (1): % 
has a link to j and j has no one to i. (2): j has a link to i and % has no one to j. (3): both 
i and j have no links to each other. (4): both i and j have links to each other in which case 
the connectivity between them is larger than in cases 1 and 2. 

2.2 The "Thermo dynamical" discussion and representation of the 
Internet 

The mentioned possible different kinds of linkings between the sites influences the standard 
expressions and formulas of the Ursell-Mayer expansion jTH] in such a manner that the 
application of it to the Internet statistics becomes, as will be shown, delicate and complicated. 
First, we note that the use of physical thermodynamical methods for discussing the Internet 
structure enables one not only to use the variable s in an analogous manner to the spatial 
variable r but also to use other thermodynamical quantities. We note in this context that the 
use of Thermodynamics terminology and terms for discussing other (nonphysical) branches 
of science may be found in the general literature (see, for example, [HI EH]). Thus, we may 
discuss an appropriately defined partition function Z(s) as well as the related "pressure" P(s) 
and the "free energy" F(s) in the linked cluster of websites. We show by applying these 
fundamental concepts to the Internet that the remarked phase transition of the cluster's 
connectivity may be related to the internetic "pressure" P(s) of its sites. This will be 
demonstrated for large clusters of sites which are doubly connected to each other as in (4) 
above (see the discussion after Eq (0)) in which case adding even a small number of links 
results in a disproportional enormous strengthening of the overall connectivity. We first 
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define the appropriate "partition function" Z(s) that determines the states of the N site 
ensemble and the correlation among them 

N=oo 

Z{s) = £ f(s)C N (V(s)) (2) 

N=0 

As seen, we use the same expression for Z(s) as used in jTHl CZj except for the dependence 
upon the variable s. The f(s) is generally an exponential function that does not depend 
upon the potential V(s) and Cjv(V(s)) is the "configuration integral" in s space which is 
given here by the same form, except for the s dependence, as that of the particle system 

C N {V{s)) = J J ...J e-P* v ^d Sl ds 2 ...d Si ... ds N , (3) 

The variable s denotes that we discuss Internet sites and each of the differentials ds\ds2 ■ ■ ■ ds n 
signifies an infinitesimal "volume" element in s-space. Thus, if we assume, analogously to 
the configuration space, that this space may be projected into three axes denoted by al, 
a2 and a3 then the i differential in Eq (JHJ) may be written as dsi = ds ial ds ia2 ds ia3 . Now, 
since Sij denotes, as remarked, the "distance" between the sites i and j (see discussion before 
Eq {!}) then we may regard the integrals in Eq Q as ranged over all "distances" from a 
reference site which corresponds to the origin of the configuration space. This reference site 
is, as remarked before Eq JIJ), the unlinked site (with zero links). The (3 S in the exponent has 
dimension of "inverse energy" and corresponds to the (3 of the particle ensemble (T5l IT6l ITTj. 
Thus, from Eqs (0)-©, and analogously to the iV particle system [11], one may define the 
appropriate "pressure" P(s) and "free energy" F(s) as 

1 d(\n( c(y(8)) )) 
P(s) = 11 m (A) 

F(s) = A(s) + SP(s) = -- HZ(s)) + ^ 1 Uffl )] (5) 
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Note that the forms of P(s) and F(s) are the same, except for the s dependence, as those 
of the N particle system [12]. The boldfaced S and s in Eqs (JH)-© denote respectively 
the volumes in s space available for iV and one sites and the A(s) in Eq JSJ) is the work 
function in this space which corresponds in its expression to the analogous work function 
or the Helmholtz free energy of the N particle system [THl HEl UZ|- In order to be able to 
use and interpret the Quantity P(s) (and F(s)), and especially to show that it is related 
to the mentioned phase transition, we have first to express C(V(s)) from Eqs J2!)-© in a 
form appropriate for discussing Internet websites. This form is found if we first ascertain the 
proper numbering procedure suitable for the linked sites and which is taken care of by the 
potential V(s). Thus, we write this V(s) as a sum of terms each of which depends on the 
"distance" Sij between any two sites i and j which denotes, as remarked, the connectivity 
between them. Now, since each two sites may be connected to each other in three different 
ways as in the situations labelled (1), (2), and (4) in the discussion after Eq JTJ we get the 
result that in a system of N sites there are 3Af (^ f ~ 1 ) different pairs so that the potential is 

i=N j=N-l i=N j=N-l 

v ( s ) = 12 12 u ( s v) + 12 12 u ( s ij) u ( s ji) ( 6 ) 

i^j j=l i>j j=l 

The first double sum takes account of the N(N — 1) pairs that are singly connected to each 
other and the second covers 

the m^R that 

are doubly connected and u(sij) is the potential 
of the pair of sites % and j as a function of the "distance" Sy between them (see Eq ((IJ)). 
Note that the number of doubly connected pairs is half that of the singly connected ones 
since a single connection between any two sites i and j may be realized in two different ways 
(see the situations (1), and (2) in the former discussion) compared to the double connection 
between them which is realized in only one way. We define, as done when discussing the 
cluster function theory [TJJIIBIIIH, ^ e function 

9H = 9{sij) = e~ u ^ - 1 (7) 
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We see that = for a large "distance" (unrelated web sites) where it(sy) tends to 
zero (see Eq (JTJ)). Note that when the sites i and j have no links to each other (unrelated 
sites) then they, naturally, also have no other common links so that gij and u(sij) are both 
zero. Also, as one may realize the probability to find two sites that are entirely identical to 
each other is very small so that u(sij) in this case becomes very large (see Eq JJJ)) such that 
g(sij) ~ — 1. By we denote the one-way connection from site i to site j when there is a 
link in i to j, and the double connection between them is denoted by g^g^i where, obviously, 
we have (see Eqs JTJ and (JJJ) g^ ^ g^. The "configuration integral" from Eq |j3J) becomes 
using the last equations 

C N (V(s)) = [[■■■ /(I + E 9a + E E 9ij9% + ■ ■ -)dsi ...ds N (8) 

N>ift>l N>i^j>l N>i^'j>l 

Note that due to the special character of the websites, as discussed after Eq (IT}, the counting 
relation between the general sites i and j is as written in the subscripts of the summation 
signs of the last equation (compare with the analogous expression in the N particle system 
(see Eq (13.3) in p. 277 in [Ej)). 

2.3 The plane diagrammatic structure of the linkings among the 
websites 



Now, in order to be able to cope with the terms under the parentheses signs in Eq (JEJ) we use 
an extended version of the one to one correspondence [13 UHl made between the terms of C 
of the iV-particle ensemble and certain planar diagrams. For example, the diagram shown 
in Figure 1 for iV = 24 corresponds to the term in Eq (JHJ that involves g\^ g^, g^, gg t w, 
0n,i2, 011,19, 012,19, 012,20, 013,20, 015,16, 015,23, 016,23 , 016,24, 019,20 and #23,24 (we have inserted 
commas between the i and j components of g^). As seen from the diagram the sites (denoted 
in this paragraph by curly brackets) are assembled in clusters, so that the sites {7}, {8}, 
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{14}, and {22} each constitutes a single cluster of 1 member. The sites {1}{2}, {5}{6}, and 
{17}{18} are in clusters of two where the pairs {1}{2} and {17}{18} are doubly connected. 
The sites {3}{4}{9}{10} and {15}{16}{23}{24} are clusters of 4 where the members of the 
first are singly connected to each other whereas those of the second are doubly connected. 
The sites {11}{12}{13}{19}{20}{21} constitute a six-member cluster. As seen from Figure 
1 the sites in a cluster are connected in a different manner to each other so that one may 
be doubly connected to all the other sites of the cluster in which case its connectivity is 
maximal whereas another may be singly connected to only one site so that its connectivity is 
minimal. For example, in the six member cluster of Figure 1 the site {20} is doubly connected 
to the sites {11}, {12}, and {19} and singly connected to {13} and {21}, whereas the sites 
{13} and {21} are each singly connected to only the site {20}. This difference between the 
connected sites and the less connected ones is realized when one remove from the cluster one 
site and all its connecting lines. Thus, if the removed site is the densely connected one the 
connectivity of the remained cluster is weakened considerably whereas if the removed site is 
slightly connected the effect on the connectivity of the remaining cluster is less influential. 
For example, if the site {20} and all its connecting lines in the six-member cluster of Figure 
1 is removed the cluster is actually broken into three different smaller ones whereas if either 
the site {13} (or {21}) is removed together with its single connecting line the connectivity 
of the remaining 5-member cluster is only slightly affected. 

We note that Figure 1 is reminiscent of Fig 13.1 in page 278 in [TH] which shows a similar 
diagram for 28 molecules. The principal difference between the two diagrams is that in 
Figure 13.1 in [03 the connection, if exists, between any two molecules is unique and not 
directed as in the plane diagram of Figure 1 here. That is, as remarked, the linking between 
any two sites may be of three kinds and the lines that connect them reflect this diversity of 
connection. 
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Figure 1: An example of the planar diagram that maps the terms of the configuration integral 
from Eq (jHJ). Shown in the diagram is one way, from a large number of possible ones, of 
connecting 24 sites into clusters where the linking between any two sites, if exists, may be 
of the single connection type or the double one. 

2.4 The ordering of the sums of the integrals of Cn(V(s)) into clus- 
ters Q 

We denote the number of times an /-site cluster appears in a term by mi so that the total 
number N of all the sites may be written, as for the analogous N particle system [TH], as 
iV = Imi. Note that the integrals over the sites in different clusters of one term in Eq 
(jHJ) are independent of each other. Thus, the integral of a term in Eq (jHJ) is the product of 
the integrals over the sites in the same cluster where the meaning of the integral over the s 
variable is as discussed after Eq Q. As for the iV particle system [TH] we sum the integrals 
of all the products that occur when a specified /-sites are in one cluster and, because of the 
different counting of Internet websites, we divide this sum, denoted by q, into two different 
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parts as 



1 

Q = ci m +c ld = — 




XX II 9a+ II 9ij9ji)ds 1 . . .ds, 



(9) 



where q m , q d denote the products over the mixed and doubly connected sites respectively. 
By the term mixed we mean that the sites in the relevant cluster may be either only singly 
connected or both singly connected (to some sites) and doubly (to others, see the example 
of c 3 in Eq ifTHj) ). Note the different counting relations between the general sites i and j for 
the mixed and doubly connected sites as expressed respectively in the subscripts of the two 
product signs of Eq (JUJ). The counting relation of I > i > j > 1 for the doubly connected 
sites, which is the same as in the analogous iV particle system (see p. 277 in [15]), is because 
this kind of connection is unique and does not depend on direction. The number of gij in 
each term of the q m ranges from a minimum of (I — 1) to a maximum of (1(1 — 1) — 1) and 
the corresponding number in the q d part ranges from a minimum of 2 (/ — 1) to a maximum 
of 1(1 — 1) where these numbers are always even for q d . The product US is a normalization 
factor where S is, as remarked, the total volume in s-space of all the N sites. For example, 
c 3 is 

c 3 = c 3d + c 3m = J J J ds 1 ds 2 ds 3 [(g 31 g 13 g 2 igi2 + 5-32^23^31^13 + 
+932923921912 + 932923931913921912) + (4# 3 i#2i + 4#32#3i + 4# 32 # 21 + 8g 32 g 3 ig2i + 
+4=93i9i392i + 4g 32 g 23 g 3 i + 4g 21 g 12 g 32 + 4.g 32 g 23 g 31 g 21 + 4.g 31 g 13 g 32 g 21 + (10) 
+4g 2 igi 2 g 3 ig 32 + ^931913921912932 + 2g 32 g 23 g 31 gi 3 g 2 i + 2# 32 #23S f 2iS f i25 , 3i)] 

The c 3 d is given by the first four terms and c 3m is given by the rest. Note that since each 
mixed connection between any two sites may be, as remarked, of three kinds there are 50 
terms in c 3m whereas only 4 terms in c 3d because the double connection is unique. We thus 
see that q becomes very large for increasing values of I as may be realized from Appendix A 
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in which we use combinatorical analysis for calculating the number of terms in q for general 
/. 

3 THE "CONFIGURATION INTEGRAL" C N (V(s)), THE 
CLUSTER INTEGRALS q AND THE IRREDUCIBLE 
INTEGRALS ( k 

3.1 The expression of Cjv(V(s)) in terms of c\ 

From the discussion of the former section we realize that the total contribution to the "con- 
figuration integral" from each specific /-cluster is [15J 

C{V{s)) 

specific I— cluster n(/!SQ) mi , (ii) 

i 

which is the expression obtained also for the N particle system (see p. 280 in [15] ) except 
that here we use the volume in s space. The product (/!S) has been introduced to cancel the 
effect of the normalization constant (see Eq © and the discussion following it) and mi is, 
as noted, the number of times an /-cluster appears in a term. Now, we add all the similar 
contributions from the other /-site clusters which is obtained by mutiplying the expression 
from the last equation by which is the number of ways to distribute N different sites 

into clusters so that m\ clusters have one site each, m 2 clusters have two sites each ... and 
mi clusters have / sites each. The division by Y[i m i^- ( an d not by Yli m iKty mi as f° r the N 
particle system (see page 281 in (Hp)) is because of the unique nature of the websites which 
are not only different from each other but also render their clusters, even those with the same 
number of sites, different. That is, the division Yli m i^- takes into account the permutations 
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of the sites inside the clusters and also the permutations of the clusters that have identical 
number of sites. The overall contribution from all the / site clusters is therefore 

ay(s)u ~ = n = n JV!(Sc ' )m, ,' !( """ 1) (i2) 

The last step is to sum over all values of m ; so that we obtain for the total value of the 
"configuration integral" divided by iV! 

C(V(s)) (sATq)^!^- 1 ) 

where we have used the relation s = -j| which is the volume in s space per site. As seen from 
the last equation the indices mi and I over which the sum and the product are respectively 
run must satisfy the condition [TH] — N (see the discussion before Eq (JH|). 

3.2 The ordering of the cluster integrals q into irreducible integrals 

a 



As realized from the last equations, there are many terms in q which represent clusters that 
are composed of two parts that are connected by only one link, so that removing it splits 
the relevant integral into a product of two. Thus, the cluster integrals q's may be simplified 
as in the particle system [13 [THJ by representing them as sums of integrals that can not 
be further reduced into a product of integrals. That is, these irreducible integrals, denoted 
as (k, correspond to clusters all the members of which are more than singly connected as 
follows 

& = wo //"••• II 9ijds! . . . ds k+1 , (14) 

K! ° J J J k+l>i^j>l 
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where, the product is over all the sites each of which is connected to at least two other sites. 
Note that since, as remarked, the connection between any two sites i and j may be of more 
than one kind (see the discussion after Eq (JTJ) the counting relation is as written in the 
subscript of the product sign (and not £; + l>z>j>l which is appropriate for the N 
particle system (see p. 287 in [TH])). As for the q from Eq (JUJ) we differentiate between Cfc m 
and Qk d which denote mixedly and doubly connected sites respectively where by the term 
mixedly we mean that the sites in the relevant cluster may be, as for the q mJ either only 
singly connected or both singly connected (to some sites) and doubly (to others). From Eqs 
0, © and (JT1J) we see that the terms of q m and Cfc m may be positive or negative depending 
upon the evenness or oddness respectively of the numbers of in these terms. The terms 
of q d and (k d , on the other hand, are always positive since the number of their ^ is always 
even. 

The first two (k, for example, are 



The coefficients of 2, 6, and 12 signify the number of similar terms that differ only by the 
direction of the connecting lines between the sites. Thus, the term 6<7 32 <73i<72i denotes the 
six possible different ways to construct a 3-cluster from three singly connected sites since 
each of these sites may be connected in two different ways to its neighbour. Likewise, the 
term I2g 32 g2393i92i denotes the 12 different possible ways to construct a 3-cluster from two 
doubly connected sites which are singly connected to the third. This is because there are 
3 different ways to select the two doubly connected sites from three and for each of these 
there are four different ways to singly connect these two sites to the third one. Now, we can 
write, analogously to the N particle system [11], any cluster integral q as a sum of terms 



Ci = g J J ( 2 9i2 + 9i292i)ds 1 ds 2 

C2 = -jg J J J (6#32#31#21 + 12#32523#31#21 + 6932923931913921 + (15) 
+g32g2393igi392igi2)ds 1 dS2ds 3 
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each of which depends upon the powers rik of the integrals Cfc where and k are related 
by Z)fc=i _1 krik = 1 — 1. The (—1) in the last relation is because one site is left over due to 
the definition of (k (see the discussion before Eq (jTH) ). Thus, using the arguments in the 
Appendix in [TH] (see p. 455-459 there) with respect to the N particle system we write the 
following dependence of c\ upon ^ 

where we take into account that the number of ways to distribute I different objects into 
clusters of k objects each (with X^ti" 1 kn>k — (I — 1)) is rrri — r The expression (fTH|) results 
also from considering the three possible connections between any two sites that introduce 
different counting (see the discussion before Eq (|T2|) and after (fl~5j) and note the similarity 
between Eqs (fTTfll and (jTT)jl). 

3.3 The general term of Cn(V(s)) as a function of q and that of q 
as a function of (k 



We follow in this subsection the same steps and the same expressions, used by Mayer [TH] 
in his demonstration of phase transition for the iV particle system, except for the s variable 
and the mentioned generalization necessary for discussing the Internet websites. That is, 
we demonstrate, using the following equations (fl~7| - (|23|) . that the overall connectivity of the 
doubly linked cluster may undergoes phase-transition. 

We begin by using the Stirling approximation [2Q] for In/!, lnm ; !, ln/c! and lnn fc ! so one 
may write the logarithms of the general terms of the sums of Eqs (TH^ and (JTK|) . denoted by 
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lnr c and lnl^ respectively as (compare with Eqs (13.13) and (14.2) in (THJ ) 



l=N 



lnT c = y~] (mi ln(siVq) + (mi — l)(/ln / — /) — m\ lnm^ + m{) 
i=i 

k=l-l 



In T c = ( n k l n (Cfc) + i n k ~ 1) (k In k - k) + n k In / - n k In n k + n k ) 

k=l 



(17) 



-ln3 - 21n/ 



Now, if all the ( k s are positive one may replace [TH], for large values of /, the logarithm of 
the sums in Eqs (fT31) and (|Tfil) by that of their largest terms. These are obtained from Eqs 
(fT7|l by subtracting from the first of which the constant (— InZ) multiplied by the condi- 
tion YhZi l m i = N and from the second the constant (— lnp) multiplied by the condition 
J2k=]~ l kn k = (I— 1). These kinds of operation and the denotation of the constants as (— In Z) 
and (— lnp) are done, as in [11], for a clear representation of the relevant calculations. 

We, now, differentiate the first of the resulting expressions (related to In Y c ) with respect 
to mi and the second (related to lnl^) with respect to n k and equate to zero so that the 
values of mi and n k that maximize In T b and In respectively are 



Note that, due to the nature of q and ( k (see Eqs © and (fTi^ . both Z and p have dimension 
of inverse s. Substituting for n k in the second of Eqs (fT7]l and regarding the resulting 
expression, in the limit of very large /, as representing the logarithm of the sum in (JT6| one 
obtains, neglecting the terms (—In 3) and (—2 In/) compared to / and using the condition 
for large / kn k = I (instead of kn k = (I — 1)) 



mi = sNcil l e 1 Z\ n k = l( k k k e k p k 



(18) 



k=i-i 




lim lnl^ = lim lnq = lim /( V] ( k k k e k p k — lnp) = lim / In 6, 



(19) 
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where In bo = Y,k=i 1 (kk k e k p k — In p, which leads to 



Compare with the analogous expressions (14.6)-(14.7) in [TJ)| for the N particle system. From 
the last two equations one obtains, in analogy to the N particle system |Tj 



* = 0*4, (2i) 

where f(l, Q is small compared to I so that it satisfies lim^oo ln ^'^ = 0. Now, the condition 
Y,\Zi l m i — N may be written, using the first of Eq (fT%l) . as 

l=N l=N 

J2lmi=Yl Nsl l+1 e~ l ciZ l = N (22) 
i=i i=i 

Substituting for q from Eq (|2*T|) into the last equation, dividing by N, and using for large / 
the approximation w l l one obtains 

l=N 

^s/(/,C)(/e- 1 6 ^) i = l (23) 
l=i 

The last equation, as shown in the next section, is central in demonstrating that enlarging 
the remarked connectivity even slightly, by adding the same links to them, results in an 
enormous strengthening of the overall connectivity. 
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4 THE PHASE TRANSITIONAL STRENGTHENING 
OF THE CONNECTIVITY AMONG THE INTER- 
NET WEBSITES 

4.1 The results for a cluster of I = N = 10 5 sites 

We, now, assume that the number of sites I of the /-cluster is large and that they are doubly 
connected to each other, in which case both q and £fc are, as remarked, positive due to the 
even number of their g^ . Thus, dividing both sides of Eq by s and taking the logarithm 
of the term that corresponds to I = N we obtain for this term (denoted T^) 

\nT N = \nf(N,() + N\n(Ne-\Z) (24) 

The expression NboZe -1 may be written as e e so that if e = then Nb$Ze~ l = 1, and 
when e is negative Nb^Ze' 1 is smaller than unity and the relevant term Nlia^Ne^boZ) in 
Eq (|2H) is negative. In this case since it is the dominant term In Tat on the left hand side 
of Eq |24l) is also negative, which implies that IV is smaller than unity and its contribution 
to the sum in Eq (|23|) is small. Thus, for exemplifying the remarked giant increase of the 
connectivity we assign to N the value of N = 10 5 and take into account the former result of 
lim^oo ln f( 1 '^ = 0, which requires ln/(iV, Q to be much smaller than N. So, if e increases 
from zero by only the small amount of 10~ 4 , due to adding a small Quantity of linked sites 
to the cluster, the value of In Tat grows, as a result of this, by 10 1 and the contribution of 
to the sum increases by 10 10 . That is, adding even a small number of connecting links to a 
large cluster of connected sites results in a disproportional strengthening of the connectivity 
of this specific cluster. 
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The previous results may be realized from the two Subfigures of Figure 2 which show 
ln(rV) and from Eq (f2~i|) as functions of e in the range —0.01 < e < 0.01 and for 
iV = 10 5 . The left hand side Subfigure shows the change of ln(rjv) = TVe from Eq (|2~H) 
in the neighbourhood of e = where we have written Nb^Ze' 1 = e e and ignore the term 
ln(/(iV, ()) which vanishes for large N. Thus, one may see that ln(rV) is proportional to e 
where the coefficient of proportion is N. But the behaviour of Tn = e Ne (compared to that 
of In (Fat)) is not the same for positive and negative e as may be seen from the right hand 
side Subfigure of Figure 2. That is, although negative e may produce large negative values 
of In (Fat) as seen in the left hand side subfigure it yields a rather negligible change of T N . 
But when e departs slightly from zero towards positive values the produced change in Tn 
becomes so enormous that even the small change of 5e « 0.006, yields the giant change of 
Ar^v ~ 10 ■ 10 303 . That is, the overall connectivity of the large cluster has undergones, as 
remarked, a phase transition change. 

4.2 The inverse proportionality between the connectivity in a large 
/ cluster and the "volume" per site in s space 

Now, remembering that Eq (|2~i|) was obtained after dividing both sides of Eq (|23|) by s 
we have to conclude, in order to retain Eq that s must decrease in this case to a 

correspondingly very small value where, as noted, s is the "volume" in s space per site. For 
a better understanding of the meaning of a small s we return to Eq (J1J) for the "pressure" 
P(s) and evaluate it in the limit of large I. In such case one may replace, as remarked, 
m(^pi), where is given by Eq (UJ), by ln(r c ) from the first of Eqs JTIJ). Thus, 

using mi = sNcil l e~ l Z l sNcil^ l+1 ^e^ 1 Z l and J2^ m i = N (see the first of Eqs (fT£l) and Eq 
((HI ) we may evaluate the "pressure" P(s) from Eq (j3J as follows 



1 <9(ln( 



C(V(s)) 



)) 



i a(in(r c )) 

(3 S N ds 




Nl 



(25) 



(3 S N 



ds 
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Figure 2: The left and right subfigures show respectively the change of mr^ and T N from 
Eq ((2H in the neighbourhood of e = 0. Both graphs are plotted for N = 100000 and for the 
range of 5e = (—0.01, 0.01) where any two neighbouring values of e differ by Note the 
giant change of Ar^ = 10 • 10 303 for the slight increase of 5e ~ 0.006. 



1 <9(ln(£z=i V (mi(\n(sNci) + (llnl - I) - mm, + 1) - I (In I - 1)))) 
&iV ds ~ 

1 9(ln(Ej=f [sN Cl l l e- l Z l {l - \a{l l e~ l Z l ) + /(In I -I))- /(In / - 1)))) 



(3 S N 



ds 



1 9(ln(E{=f (siVcjZ'e- 1 ^ -/(In/- 1)) -AHnZ)) 1 iV 



5s 



Substituting the last result in the expression for the "free energy" from Eq (jHJ we obtain 
F(s) = A(s) + SP{s) = - j- \n(Z(s)) + J^. Note that although q contains the total "volume" 
S in its denominator (see Eq Q) it certainly does not depend on it since the integrals of q 
always lead to a factor S that cancels that in the denominator. Thus, from the last equations 
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we see that the "pressure" P(s) (and the "free energy" F(s)) becomes very large for very 
small s. The meaning of small s may now be understood by recalling that the sites of the 
relevant linked cluster are related to each other by the variable s that denotes the "distance" 
between them in the sense of how much they are similar to each other (see the discussion 
before Eq (jTJ). That is, small s for the large cluster of doubly linked sites means that the 
"distances" that signify the differences between them become also very small and they all 
turn out to be similar to each other. The vanishing "distances" between the sites in this 
case results in high values for the "pressure" and the "free energy" (see the last equations) of 
these "jammed" sites. This occurs, as remarked, when adding a small amount of links to the 
larger terms of the sum in Eq (j2Tfl) which results in the outcome that the overall connectivity 
among the component sites becomes maximal in the sense that they have the same links and 
so they are similar to each other. 

Note that the former discussion depends on the assumption that all the Cfc are positive 
whereas we know that the signs of q and Cfc alternate due to the evennes and oddness of 
the number of ^ (see the discussion after Eq (jH])). But if we confine our discussion to 
only the doubly connected sites then all the q and (k are, as remarked, positive (in addition 
to their being mutually connected as required) so all the former discussion is obviously 
valid. That is, the doubly connected clusters may show this kind of phase transition-like 
of the connectivity by only adding a very small amount of connecting links. Note that the 
mathematical expressions of the doubly connected N site system are the same as the N 
particle system discussed in jTH] except that in the websites system all the c\ d and C^ d are 
positive whereas they alternate in sign for the particle system. Also, this kind of avalanche (or 
condensation as termed in [15) ) have been shown in (THJ for the N particle system provided 
one assumes that all the Cfc are positive (as here). 

We note that the process just described of a large increase in the fraction of the connec- 
tivity in linked large-size clusters is frequently encountered in ensembles of shared computers 
connected to each other and to the Internet. In this case an additional web site in the form 
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of a file that has been programmed intentionally for the purpose of adding and connecting 
it to the other files of the shared ensemble enhances the connectivity of these members to 
a very high degree. For example, suppose that all these members use some utility program 
and that not all of them have the same version of it but use several ones. Thus, adding an 
updation file of this program to them turns all the different versions into one that is common 
to all thereby increasing very much the total connectivity of them. 

5 CONCLUDING REMARKS 

We have discussed the Internet statistics using a generalized version of the Ursell-Mayer 
cluster formalism [HI [H3 HBJ. We discuss the relevant elements of the Internet, which 
are the websites, by introducing a new variable that takes account of the unique nature 
of them. The special character of the Internet, unlike any other common fractal web, is 
especially demonstrated through the links in its websites which may be of several kinds 
as discussed after Eq JlJ). Taking into account this unique character and introducing the 
relevant expressions into the Ursell-Mayer framework we have shown that one may obtain 
a phase-transition phenomena. This occurs when a large cluster of doubly linked sites are 
added an extra small amount of connecting links in which case the contribution of this cluster 
to the overall connectivity of the entire ensemble becomes enormous. These phase transition 
changes were also seen to characterize the "pressure" P(s) and the "free energy" F(s). 
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A APPENDIX: THE COMBINATORICS OF THE l- 
SITE CLUSTER 



We calculate in this Appendix the number of terms contributed to the "configuration integral" 
by any specific c\. This contribution is valid only when the sites in the same cluster are linked 
to each other and is obtained by exploiting the special character of the q from Eq (JHJ in 
order to find all the possible ways by which it contributes to the configuration integral. We 
take into account that the number of in the terms of q ranges, as noted, from (/ — 1) 
to /(/ — 1) and that each g^ denotes, as remarked, a link to site j in site i. Thus, the 
contribution of each q may be obtained by calculating the number of possible ways by which 
each of the Quantities of links (I — I), I, . . .1(1 — 1), may link / sites among them so as to 
construct an /-site cluster. We begin from the least linked /-cluster which is constructed by 
using only (/ — 1) different links. The number of ways to construct such cluster is (we denote 
this number Wi(l — 1)) 



W t (l-1) 



1 1(1-1)^ 
(J-l) j 



W-W (Al) 



(/-!)!(/ -I) 2 )! 



( *(*-!) ^ 

in the middle expression is the number of combinations required to construct 

this least /-site cluster where there are no two links in any of these combinations that refer 
to each other such as and g^. Now, since, as remarked, the number of in the terms 
of Eq ((OJ) ranges from (I — 1) to /(/ — 1) we have, in order to find all the possible different 
contributions of each q, to calculate also the Wi(l), Wi(l + 1) . . . W\(l(l — 1)). Each of these 
is found by first constructing the minimally connected /-cluster from (/ — 1) sites as in Eq 
(A.l) and since each site in the /-cluster may be linked to all the other (/ — 1) sites, except 
to itself, the next step is to link each of the sites to all the others. That is, the total number 
of ways to compose an / cluster using a number of links that ranges from (/ — 1) to /(/ — 1) 
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is W t (l - 1) + Wi(l) + ... + Wi(l + k) + ... Wi(l(l - 1)). Wi(l - 1) in the former sum is given 
by Eq (A.l) and each one of the other Ws is obtained by noting that after composing the 
minimally linked /-cluster using the (I — 1) links, from the possible 1(1 — 1), one remains 
with 1(1 — 1) — (I — 1) = (I — l) 2 links. These links, in contrast to the former least linked I 
cluster, may be formed so that it is allowed to count also mutually linked sites such as gij 
and gji. That is, for calculating the number of ways to link these (I — l) 2 additional links 
we may permute them instead of the former combinations used for the initial (I — 1) links. 
Thus, as remarked, for calculating Wi(l), . . . Wi(l + k), . . . Wi(l(l — 1)) we first construct the 
least linked / cluster, using (I — 1) links as in Eq (A.l), and then we link the remaining links 
which range from 1 for Wi(l) to (I — l) 2 for Wi(l(l — 1)). For example, the number of possible 
ways to construct an / site cluster using (/ + k) links, where < k < 1(1 — 2) is given by the 
recursive relation 

Wi(l + k) = W t (l + k - 1) • (1(1 - 1) - (I + k - 1)) = Wi(l + k - 1) • ((I - l) 2 - k) (A.2) 

The factor (1(1 — 1) — (/ + k — 1)) in the middle expression is the number of ways to link 
the remaining links, from the initial possible 1(1 — 1), after linking (/ + k — 1) links. When 
k = we obtain from Eq (A.2) Wi(l) = W\(l — 1) ■ (I — l) 2 . From the last equation one 
obtains the number of ways for using I links, from a total of /(/ — 1), to construct an / 
cluster where the first (/ — 1) links are used to initially compose the least linked cluster 
and the additional link may be anyone from the remaining (/ — l) 2 . When k = 1(1 — 2) 
one obtains from Eq (A.2) Wi(l(l — 1)) = W\(l(l — 1) — 1). Now, we can calculate the sum 
Wi(l - 1) + Wi(l) + . . . + Wi(l + k) + . . . + Wi(l(l - 1)) in order to find the number of ways 
by which one may build an /-site cluster. That is, we write, using the former equations 

W t (l - 1) + W t (l) + . . . + W[(l + k) + ... + W t (l(l - 1)) = 



= Wi(Z -!)[! + (/- 1) 2 [1 + ((I - l) 2 - 1) + ((I - l) 2 - !)((/ - l) 2 - 2) + 
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m=k 



m=l(l-l) 



+•••+ n (c- 1 ) 



n (C-l) 



2 



m)]] 



(A3) 



m=l 



m=l 



fe=i(J-l) m =fc 



wi(z-i)[i+(/-i) 2 [i+ e n^-i) 2 



m)]] 



fc=l m=l 



The last result is the number of terms of q and it is very large for large I. 
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